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Abstract 

We study the query complexity of testing for properties defined by read once formulae, as 
instances of massively parametrized properties, and prove several testability and non-testability 
results. First we prove the testability of any property accepted by a Boolean read-once formula 
involving any bounded arity gates, with a number of queries exponential in e and independent 
of all other parameters. When the gates are limited to being monotone, we prove that there 
is an estimation algorithm, that outputs an approximation of the distance of the input from 
satisfying the property. For formulae only involving And/ Or gates, we provide a more efficient 
test whose query complexity is only quasipolynomial in e. On the other hand we show that such 
testability results do not hold in general for formulae over non-Boolean alphabets; specifically 
we construct a property defined by a read-once arity 2 (non-Boolean) formula over alphabets of 
size 4, such that any 1/4-test for it requires a number of queries depending on the formula size. 

1 Introduction 

Property Testing deals with randomized approximation algorithms that operate under low infor- 
mation situations. The definition of a property testing algorithm uses the following components: 
A set of objects, usually the set of strings £* over some alphabet S; a notion of a single query to 
the input object w = (wi, . . . ,w n ) € £*, which in our case would consist of either retrieving the 
length \w\ or the i'th letter Wi for any i specified by the algorithm; and finally a notion of farness, 
a normalized distance, which in our case will be the Hamming distance — farness (w, v) is defined 
to be oo if | io | ^ \v\ and otherwise it is |{i : Wi ^ vi\\/\v\. 

Given a property P, that is a set of objects P C £*, an integer q, and a farness parameter e > 0, 
an e-test for P with query complexity q is an algorithm that is allowed access to an input object 
only through queries, and distinguishes between inputs that satisfy P and inputs that are e-far from 
satisfying P (that is inputs whose farness from any object from P is more than e) while using at 
most q queries. By their nature the only possible testing algorithms are probabilistic, with either 
1-sided or 2-sided error (1-sided error algorithms must accept objects from P with probability 1). 
Traditionally the query "what is |tu|" is not counted towards the q query limit. 

The ultimate goal of Property- Testing research is to classify properties according to their optimal 
e-test query-complexity. In particular, a property whose optimal query complexity depends on e 
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alone and not on the length \w\ is called testable. In many (but not all) cases a "query-efficient" 
property test will also be efficient in other computational resources, such as running time (usually 
it will be the time it takes to retrieve a query multiplied by some function of the number of queries) 
and space complexity (outside the space used to store the input itself). 

Property- Testing was first addressed by Blum, Luby and Rubinfeld [3], and most of its general 
notions were first formulated by Rubinfeld and Sudan |18j . where the investigated properties are 
mostly of an algebraic nature, such as the property of a Boolean function being linear. The first 
excursion to combinatorial properties and the formal definition of testability were by Goldreich, 
Goldwasser and Ron [TT]. Since then Property- Testing has attracted significant attention leading 
to many results. For surveys see [6], [TO], [TO], [17] . 

Many times families of properties are investigated rather than individual properties, and one 
way to express such families is through the use of parameters. For example, fc-colorability (as 
investigated in [11] ) has an integer parameter, and the more general partition properties investigated 
there have the sequence of density constraints as parameters. In early investigations the parameters 
were considered "constant" with regards to the query complexity bounds, which were allowed to 
depend on them arbitrarily. However, later investigations involved properties whose "parameter" 
has in fact a description size comparable to the input itself. Probably the earliest example of 
this is [H], where properties accepted by a general read-once oblivious branching program are 
investigated. In such a setting a general dependency on the parameter is inadmissible, and indeed 
in [TO] the dependency is only on the maximum width of the branching program, which may be 
thought of as a complexity parameter of the stated problem. 

A fitting name for such families of properties is massively parametrized properties. A good 
way to formalize this setting is to consider an input to be divided to two parts. One part is the 
parameter, the branching program in the example above, to which the testing algorithm is allowed 
full access without counting queries. The other part is the tested input, to which the algorithm is 
allowed only a limited number of queries as above. Also, in the definition of farness only changes to 
the tested input are allowed, and not to the parameter. In other words, two "inputs" that differ on 
the parameter part are considered to be oo-far. In this setting also other computational measures 
commonly come into play, such as the running time it takes to plan which queries will be made to 
the tested input. 

Recently, a number of results concerning a massively parametrized setting (though at first not 
under this name) have appeared. See for example [TOl [9] and the survey [TO], as well as [2], 
where such an e-test was used as part of a larger mechanism. 

A central area of research in Property- Testing in general and Massively-Parametrized Testing 
in particular is to associate the query complexity of problems to their other measures of complexity. 
There are a number of results in this direction, to name some examples see [TJ [TO] [8]. In [3] the 
study of formulae satisfiability was initiated. There it was shown that there exists a property that 
is defined by a 3-CNF formula and yet has a query complexity that is linear in the size of the input. 
This implies that knowing that a specific property is accepted by a 3-CNF formula does not give 
any information about its query complexity. In [TO] it was shown that if a property is accepted by 
a read-twice CNF formula, then the property is testable. Here we continue this line of research. 

In this paper we study the query complexity of properties that are accepted by read once 
formulae. These can be described as computational trees, with the tested input values at the leaves 
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and logic gates at the other nodes, where for an input to be in the property a certain value must 
result when the calculation is concluded at the root. 

We prove a number of results. Section [2] contains preliminaries. First we define the properties 
we test, and then we introduce numerous definitions and lemmas about bringing the formulas whose 
satisfaction is tested into a normalized "basic form". These are important and in fact implicitly 
form a preprocessing part of our algorithms. Once the formula is put in a basic form, testing an 
assignment to the formula becomes manageable. 

In Section[3]we show the testability of properties defined by formulae involving arbitrary gates of 
bounded arity. For such formula involving only monotone gates, we provide an estimation algorithm 
in Section [U that is an algorithm that not only tests for the property but with high probability 
outputs a real number rj such that the true farness of the tested input from the property is between 
n — e and rj + e. In Section [5] we show that when restricted to And/Or gates, we can provide a test 
whose query complexity is quasipolynomial in e. 

On the other hand, we prove in Section [7] of in the appendix that these results can not be 
generalized to alphabets that have at least four different letters. We construct a formula utilizing 
only one (symmetric and binary) gate type over an alphabet of size 4, such that the resulting 
property requires a number of queries depending on the formula (and input) size for a 1/4- test. 

We suppply a brief analysis of the running times of the algorithms in Section [BJ One interesting 
implication of the testability results presented here, is that any read-once formula accepting an 
untestable Boolean property must use unbounded arity gates other than And/Or. 
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2 Preliminaries 

We use [k] to denote the set {1, . . . , k}. A digraph G is a pair (V, E) such that E C V x V. For 
every v 6 V we set out-deg(u) = {u € V \ (u, v) € E}. A path is a tuple (ui, . . . ,Uk) € \V\ k 
such that uit ■ ■ ■ ,Uk are au distinct and (uj,Uj+i) € E for every i £ [k — 1]. The length of a path 
(ui, . . . , Uk) € \V \ k is k — 1. We say that there is a path from u to v if there exists a path (ui, . . . , Uk) 
in G such that U\ = u, and Uk = v. The distance from u € V to v € V, denoted dist(w, v), is the 
length of the shortest path from u to v if one exists and infinity otherwise. 

We use the standard terminology for outward-directed rooted trees. A rooted directed tree is a 
tuple (V, E, r), where (V, E) is a digraph, reF and for every v G V there is an unique path from 
r to v. Let u, v E V. If out-deg(u ) = then we call v a leaf. We say that u is an ancestor of v 
and v is a descendant of u if there is a path from u to v. We say that u is a child of v and v is a 
parent of u if (v,u) € E, and set Children(i;) = {w G V \ w is a child of v}. 

2.1 Formulae, evaluations and testing 

With the terminology of rooted trees we now define our properties; first we define what is a formula 
and then we define what it means to satisfy one. 
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Definition 2.1 (Formula) A Read-Once Formula is a tuple <3? = (V, E, r, X, k, B, £), where 
(V, E, r) is a rooted directed tree, £ is an alphabet, X is a set of variables (later on they will take 
values in H), B C \J k<00 {H k \-t £} a set of functions over £ ; and k: V^-BUXUY, satisfies the 
following (we abuse notation somewhat by writing k v for k{v)). 

• For every leaf v G V we have that k v G X U £ . 

• For ewery t> i/iai is no£ a leaf k v £ B is a function whose arity is |Children(i;)|. 

In the case where B contains functions that are not symmetric, we additionally assume that for 
every v G V there is an ordering of Children(u) = (ui, . . . , u&). 

In the special case where £ is the binary alphabet {0, 1}, we say that is Boolean. Unless stated 
otherwise £ = {0, 1}, in which case we shall omit £ from the definition of formulae. A formula 
= (V, E, r, X, k, B, £) is called read k-times if for every i£l there are at most k vertices v G V, 
where n v = x. We call $ a read- once- formula if it is read 1-times. A formula = (V, E, r, X, k, B, £) 
is called k-ary if the arity (number of children) of all its vertices is at most k. If a formula is 2-ary 
we then call it binary. A function / : {0, 1}" — > {0, 1} is monotone if whenever x € {0, l} n is such 
that f{x) = 1, then for every y G {0, l} n such that x < y (coordinate-wise) we have f{y = 1) as 
well. If all the functions in B are monotone then we say that $ is (explicitly) monotone. We denote 
|$| = \X\ and call is the formula size. 

Definition 2.2 (Sub- Formula) Let <3? = (V, E, r, X, k, B) be a formula and u G V. The formula 
= (Vu, E u , «, X u , k, B), is such that V u C V, with v G V u if and only if dist(n, v ) is finite, and 
(v,w) G E u if and only if v,w G V u and (v,w) G E. X u is the set of all k v G X such that v G V u . 
If u ^ r then we call <3? u a strict sub-formula. We define |<J> n | to be the number of variables in V u , 
that is |<3? u | = \X U \. 

Definition 2.3 (assignment to and evaluation of a formula) An assignment a to a formula 
<3? = (V, E, r, X, k, B, £) is a mapping from X to £. The evaluation of $ given a, denoted ( abusing 
notation somewhat) by cr(<I>), is defined as a(r) where a : V — >■ £ is recursively defined as follows. 

• // k v G £ then o~(v) = k v . 

• // k v G X then a(v) = a{n v ). 

• Otherwise (k v G B) we set a(v) = K v (a(ui), . . . ,a(uk)), where Children(t> ) = (ui, . . . , Uk). 

Given an assignment a : X — >■ £ and u G V, we let a u denote its restriction to X u , but whenever 
there is no confusion we just use a also for the restriction (as an assignment to <£ u ). 

For Boolean formulae, we set SAT(§ = b) to be all the assignments a to $ such that o"(^>) = b. 
When 6 = 1 and we do not consider the case b = in that context, then we simply denote these 
assignments by SAT(Q). If a G SAT(&) then we say that a satisfies <I>. Let a±, o"2 be assignments to 

We define f arness$(<7i, 02) to be the relative Hamming distance between the two assignments. 
That is, f arness$(<Ti, 0-2) = \{x G X \ a\(x) / o"2(x)}|/|$|. For every subset S of assignments to 
$ we set f arness<j>(<7, S) = min{f arness$(o", a') \ a' G S}. If farness$ (a, S) > e then a is e-far 
from S and otherwise it is e-close to S. 
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We now have the ingredients to define testing of assignments to formulae in a massively 
parametrized model. Namely, the formula $ is the parameter that is known to the algorithm 
in advance and may not change, while the assignment a : X — > £ must be queried with as few 
queries as possible, and farness is measured with respect to the fraction of alterations it requires. 

Definition 2.4 [(e, g)-test] An (e, g)-test for SAT(<&) is a randomized algorithm A with free access 
to <E>, that given oracle access to an assignment a to & operates as follows. 

• A makes at most q queries to a (where on a query x £ X it receives a x as the answer). 

• If a G SAT(&), then A accepts (returns 1) with probability at least 2/3. 

• If a is e- far from SAT(<&), then A rejects (returns 0) with probability at least 2/3. Recall that 
a is e-far from SAT(Q) if its relative Hamming distance from every assignment in SAT(&) is 
at least e. 

We say that A is non-adaptive if its choice of queries is independent of their values. We say that 
A has 1-sided error if given oracle access to a £ SAT($>), it accepts (returns 1) with probability 1. 
We say that A is an (e, g)-estimator if it returns a value n such that with probability at least 2/3, 
a is both n + e-close and n — e-far from SAT(<&). 

We can now summarize the contributions of the paper in the following theorem: 

Theorem 2.5 (Main Theorem) The following theorems all hold: 

• For any read-once formula <!> where B is the set of all functions of arity at most k there exists 
a 1-sided (e, q)-test for SAT(<&) with q = exp(poly(e -1 )). 

• For any read-once formula $ where B is the set of all monotone functions of arity at most k 
there exists an (e,q) -estimator for SAT(&) with q = exp(poly(e -1 )). 

• For any read-once formula <3? where B is the set of all conjunctions and disjunctions of any 
arity there exists an (e, q)-test for SAT(&) with q = e°^ oge K 

• There exists an infinite family of 4 valued read-once formulae $ such that there is no non- 
adaptive (e, q)-test for SAT(&) with q = 0(depth(<I>)), and no adaptive (e,q)-test for SAT(Q) 
with q = 0(log(depth(<I>))). 

Note that for the first two items, the degree of the polynomial is linear in k. 
2.2 Basic formula simplification and handling 

In the following, unless stated otherwise, our formulae will all be read-once and Boolean. For our 
algorithms to work, we will need a somewhat "canonical" form of such formulae. We say that two 
formulae $ and are equivalent if cr($) = for every assignment a : X — > S. 

Definition 2.6 The mDNF (monotone disjunctive normal form) of a monotone boolean function 
f : {0, l} n — > {0, 1} is a set of terms T where each term Ti £ T is a subset Ti C [n], there exists no 
two different terms Ti,Tj £ T such that Ti C Tj, and for every x £ {0, l} n ; f(x) = 1 if and only if 
there exists a term Tj £ T such that for all i £ Tj, we have that Xi = 1. 
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Observation 2.7 Any monotone boolean function f : {0, l} n — > {0, 1} has a unique mDNF T. 

Proof. Create terms for all of the 1-witnesses of the function /, and proceed to remove all terms 
Ti <G T such that there exists a term Tj € T and Tj C Tj. One can see that this process results in 
a unique mDNF and that conversely any mDNF must result from this process. ■ 

Definition 2.8 For u € V , v £ Children(u) is called (a,b)-forceful if a(y) = a implies a(u) = b. 
v is forceful if it is (a,b)-forceful for some a,b £ {0, 1}. 

Forceful variables are variables that cause "Or-like" or "And-like" behavior in the gate. 

Definition 2.9 A vertex v GV is called unforceable if no child of v is forceful. 

Definition 2.10 (A>2>Basic formula) A read-once formula $ is fc-x-basic if it is Boolean, all 
the functions in B have arity at least 2, and are either of arity at most k and unforceable, or A or 

V of arity at least 2, and <3? satisfies the following. There is no v G V such that k v 6 {0, 1}. No A 
is a child of a A and no V is a child o/aV. Any variable may appear at most once in a leaf, either 
positively or negated. 

The set of variables that appear negated will be denoted by -*X. 

Definition 2.11 (/c-Basic formula) A read- once formula <3? is a fc-basic formula if it is k-x-basic 
and furthermore, all unforceable functions in B are also monotone. If B contains only conjunctions 
and disjunctions we abbreviate and call the formula basic. 

Lemma 2.12 Every read-once formula <I> with gates of arity at most k has an equivalent k-x-basic 
formula 

Proof. Suppose for some u that v € Children(u) is (a,b)-forceful. If b = 1 then k u can be 
replaced with an V gate, where one input of the V gate is v if a = 1 or the negation of v if a = 0, 
and the other input is the result of u when fixing a(n v ) = 1 — a. If b = then k u can be replaced 
with an A gate, where one input of the A gate is v if a = or the negation of v if a = 1, and the 
other input is the negation of the gate u when it is assumed that cf{k v ) = a. After performing this 
transformation sufficiently many times we have no forceable gates left. 

We will now eliminate -> gates. Any -> gate in the input or output of a gate which is not A or 

V can be assimilated into the gate. Otherwise, a -> on the output of an V can be replaced with an 
A with -i's on all of its inputs, according to De- Morgan's laws. Also by De-Morgan's laws, a -> on 
the output of a A can be replaced with an V with -i's on all of its inputs. 

Finally, any V gates that have V children can be merged with them, and the same goes for A 
gates. Now we have achieved an equivalent fc-x-basic formula. ■ 
Note that V and A gates are very much forceable. 

Observation 2.13 Any formula which is comprised of only monotone k-arity gates has an 
equivalent k -basic formula <£'. 
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2.3 Observations about subformulae and farness 

Definition 2.14 (heaviest child h(v)) Let $ = (V, E,r, X, k, B) be a formula. For every v G V 
we define h(v) to be v if Children(f) = 0, and otherwise to be an arbitrarily selected vertex 
u G Children(t;) ; such that \& u \ = max{|<£v| | w G Children(t;)}. 

Definition 2.15 (vertex depth depth^i;)) Let <I> = (V, E, r, X, k, B) be a formula. For every 
v we define depth <J) (f) = dist(r, v) and depth($) = max{depth (J) (n) | u G V}. 

Observation 2.16 Let v G V be such that either k v = V and b = or k v = A and 6=1, 
and f arness(er, SAT(Q V = b)) > e. For every 1 > a > there exists S C Children(t;) such that 
J2seS \®s\ > ea 2 |$| and farness (a, SAT(Q W = b)) > e(l — a) for every w G S. Furthermore, the 
exists a child u G Children(i;) such that farness (a, SAT(<& U = b)) > e. 

Proof. Let T be the maximum subset of Children(u) such that <& w is e(l — a)-far from being 
evaluated to b for every w G T. If J2teT 1^*1 < ea 2 |<^| then the distance from having $ v evaluate 
to b is at most ea 2 + e(l — a)(l — a) < e, which contradicts the assumption. 

For the last part, note that if no such child exists then <3?„ is e-close to being evaluated to b. ■ 

Observation 2.17 Let v G V be such that either k v = V and b = 1 or k v = A and b = 
0, and f arness(cr, SAT(Q V = b)) > e. For every child u G Children(i;) ; > |<E>je and 

f arness(cr, SAT(<& U = b)) > e(l + e). Furthermore, e < 1/2, and for any u G Children(v) \ {h(v)}, 
f arness(cr, SAT(<f> u = b)) > 2e. 

Proof. First suppose that the weight of some child u is less than e. In this case setting u to b 
makes the formula evaluate to b by changing less than an e fraction of inputs, a contradiction. 

Since there are at least two children, every child u is of weight at most 1 — e and since setting 
it to b would make & v evaluate to b, it is at least e(l + e)-far from being evaluated to b. 

For the last part, note that since Since |Children(t> )[ > 1, there exists u G Children(u) such 
that \&u\ < 1*^1)1/2- Thus every assignment to is 1/2-close to an assignment a' by which & v 
evaluates to b. Also note that any u G Children(i;) \ {h(v)} is of weight at most 1/2, and therefore 
if $> u were 2e-close to being evaluated to 6, <!>„ was e-close to being evaluated to b. ■ 

2.4 Heavy and Light Children in General Gates 

Definition 2.18 Given a formula <E>, a parameter e and a vertex u, we let I = £(u, e) be the smallest 
integer such that the size of the i'th largest child of u is less than \&\(4k/e)~ e if it exists, and set 
t = k + 1 otherwise. The heavy children of u are the i — 1 largest children of u, and the rest of the 
children of u are its light children. 

Lemma 2.19 // an unforceable vertex v has a child u such that 1^1(1 — e) < \<& u \> then a is both 
e-close to SAT($ V = 1) and e-close to SAT($ V = 0). 

Proof. The child is unforceful, and therefore it is possible to change the remaining children to 
obtain any output value. ■ 
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Observation 2.20 If k u ^£ A and k u G" X and a is e-far from SAT(& U = b), then it must have at 
least two heavy children. 

Proof. By the definition of £, if there is just one heavy child, then I = 2 and the total weight of 
the light children is strictly smaller than e. Therefore by Lemma [2.191 there must be more than one 
heavy child, as otherwise the gate is e-close to both and 1. ■ 

3 Upper Bound for General Bounded Arity Formula 

Algorithm[T]tests whether the input is e-close to having output 6 with 1-sided error, and also receives 
a confidence parameter 5. The explicit confidence parameter makes the inductive arguments easier 
and clearer. 

Algorithm 1 

Input: read-once fc-x-basic formula $ = (V, E, r, X, k), parameters e,6 > 0, b G {0, 1}, oracle to a. 
Output: "true" or "false". 
1: if e > 1 then return "true" 

2: if K r G X then return the truth value of o~(r) = b 

3: if K r G —>X then return the truth value of a(r) = 1 — 6 

4: if (n r = A and b = 1) or (n r = V and 6 = 0) then 

5: y i — "true" 

6: for i = 1 to I = 32(2k/e) 2k log^- 1 ) do 

7: u < — a vertex in Children(r) selected independently at random, where the probability 
that w G Children(r) is selected is 

8: y < — y A Algorithm QJ<1? U , (e(l - (2k/e)- k /16)), a, 5/2, b) 

9: end for 
10: return y 
11: end if 

12: if («; r = A and 6 = 0) or («v = V and 6=1) then 

13: if there exists a child of weight less than e then return "true" 

14: y i — "false" 

15: for all u G Children(r) do y i — y V Algorithm UX&u, + e)), a, e5/2, b) 
16: return y 
17: end if 

18: if there is a child of weight at least 1 — e then return "true" 
19: for all u G Children(r) do 

20: yl < — Algorithm (e(l + (4k/e)- k )),a,5/2k,0) 
21: y\ «— Algorithm (e(l + (4fc/e)~ fc )), a, 8/2k, 1) 
22: end for 

23: if There exists a string x G {0, l} k such that n r on x would evaluate to 6 and for all u G 
Children(r) we have y^ u equal to "true" then return "true" else return "false" 



Lemma 3.1 The depth of recursion in Algorithm{l\is at most 16(4k/e) k log(e 1 ). 



8 



Proof. If e > 1 then the condition in Line [T] is satisfied and the algorithm returns without making 
any queries. 

All recursive calls occur in Lines El [T5| 1201 and I2T1 

Since 3> is fc-x-basic, any call with a subformula whose root is labeled by A results in calls to 
subformulas, each with a root labeled either by V or an unforceable gate, and with the same b value 
(this is crucial since the b value for which A recurses with a smaller e is the b value for which V 
recurses with a bigger e, and vice- versa). Similarly, any call with a subformula whose root is labeled 
by V results in calls to subformulas, each with a root labeled either by A or an unforceable gate, 
and with the same b value. Therefore, an increase of two in the depth results in an increase of the 
farness parameter from e to at least (e(l - (2k/e)~ k /16))(e(l + (4/c/e)~ fc )) > e(l + (Ak/e)~ k /16). 
Thus in recursive calls of depth 16(Ak/e) k log(e _1 ) the farness parameter exceeds 1 and the call 
returns without making any further calls. ■ 

Lemma 3.2 Algorithm^ uses at most e - 4 80(4fc/e) fe + 3 iogiog(5 l ) q Uer { eSm 

Proof. If e > 1 then the condition in Line [T] is satisfied and no queries are made. Therefore 
assume e < 1. Observe that in a specific instantiation at most one query is used, either in Line [2] 
or Line El Therefore the number of queries is upper bounded by the number of instantiations of 
Algorithm [TJ 

In a specific instantiation at most 32(2fc/e) 2fc log((5 _1 ) recursive calls are made in total (note 
that by Line [13] there are at most 1 /e children in the case of the condition in Line [T2l and in the 
case of an unforceable gate there are at most 2k recursive calls). Recall that by Lemma 13.11 the 
depth of the recursion is at most 16(4/c/e) fc log(e _1 ). 

To conclude, we note that the value of the confidence parameter in all these calls is lower 
bounded by 8 ■ {e/2k) l ^ Ak l e ^ k ^i^ 1 ) > 5 ■ e 32 ( 4fc A) fc log^" 1 ). Therefore at most (32(4fc/e) 2fe log(<5 • 

e -32(2fc/e) fc log(fc£- 1 )^16(4fe/£) fe log(£- 1 ) = e -480(4fc/e) fc + 3 loglog^- 1 ) queries are use( J. a 

Lemma 3.3 If $ on a evaluates to b then Algorithm^ returns "true" with probability 1. 

Proof. If e > 1 then the condition of Line Q] is satisfied and "true" is returned correctly. We 
proceed with induction over the depth of the formula. If depth(<I>) = then K r G X U —>X. If 
n r G X then since <E> evaluates to b, a{r) = b, if K r E ->X then a(r) = 1 — b, and the algorithm 
returns "true" correctly. 

Now assume that depth(<!>) > 0. Obviously, for all u G Children(r), depth($) > depth(<I> u ) 
and therefore from the induction hypothesis any recursive call on a subformula that evaluates to b' 
returns "true" with probability 1. 

If K r = A and b = 1 or K r = V and 6 = 0, then it must be the case that for all u G Children(r), 
<E> U evaluates to b. By the induction hypothesis all recursive calls will return "true" and y will get 
the value "true", which will be returned by the algorithm. 

Now assume that k t = A and b = or K r = V and b = 1. Since $ evaluates to b then it must be 
the case that at least for one u G Children(r), $ u evaluates to b. By the induction hypothesis, the 
recursive call on that u will return "true", and y will get the value "true" which will be returned 
by the algorithm (unless the algorithm already returned "true" for another reason). 

Lastly, assume that r is an unforceable gate. Since $ evaluates to b, the children of r evaluate to 
an assignment x to K r which evaluates to b. By the induction hypothesis, for every u G Children(r) 
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the recursive call on & u with x u will return "true", and thus the assignment x will, in particular, 
fill the condition in Line [23] and the algorithm will return "true". ■ 

Lemma 3.4 If a is e-far from getting $ to output b then Algorithm^ returns "false" with probability 
at least 1 — 5. 

Proof. The proof is by induction over the tree structure, where we partition to cases according 
to K r and b. Note that e < 1 

If K r € X or K r € —>X then by Lines [5] or the algorithm returns "false" if a is 0-far from 
getting to output b. 

If K r = A and b = 1 or n r = V and b = 0, since a is e-far from getting $ to output b then by Ob- 
servation [2TTB] we get that there exists T C Children(r) such that J2teT \®t\ > |$|e((2/c/e)~ 2fc /16) 
and each § t is e(l — (2/c/e) _fc /16)-far from being evaluated to b. Let 5 be the set of all vertices se- 
lected in Line [71 The probability of a vertex from T being selected is at least e((2/c/e) _2A: /16). bmce 
this happens at least 2>2{2k/e) 2k \og{5~ 1 ) times independently, with probability at least 1 — 5/2 we 
have that SflT ^ 0. Letting w G TdS, the recursive call on it with parameter e(l — (2fc/e) _fc /16) 
will return "false" with probability at least 1 — 5/2, which will evetually cause the returned value 
to be "false" as required. Thus the algorithm succeeds with probability at least 1 — 5. 

Now assume that n r = A and b = or n r = V and 6 = 1. Since $ is e-far from being evaluated 
to b, Observation 12. 171 implies that all children are of weight at least e, and therefore the conditions 
of Line 1131 would not be triggered. Every recursive call on a vertex v € Children(r) is made with 
distance parameter e(l + e) and so it returns "true" with probability at most eS/2. Since there are 
at most eT 1 children of r, the probability that none returns "true" is at least 1 — 5/2 and in that 
case the algorithm returns "false" successfully. 

Now assume that K r is some unforceable gate. By Observation 12.191 since $ is e-far from being 
satisfied the condition in Line [TH] is not triggered. If the algorithm returned "true" then it must 
be that the condition in Line [23] is satisfied. If there exists some heavy u E Children(r) such 
that y h u is "true" and y}j~ b is "false", then by Lemma 13.31 the formula <I> U does evaluate to b and 
the string in x must be such that x u = b. For the rest of the children of r, assuming the calls 
succeeded, each the subformula rooted in v is (e(l + (4/c/e)~ fc ))-close to evaluate to x v . Since u 
is heavy, the total weight of Children(r) \ {u} is at most 1 — (4k/e)~ k , and thus by changing at 
most a (e(l + (4A;/e)~ fc ))(l — (4/c/e)~ fc ) < e fraction of inputs we can get to an assignment where $ 
evaluates to b. 

If all heavy children u are such that both y h u and y]^ b are "true", then pick some heavy child u 
arbitrarily. Since r is unforceable, there is an assignment that evaluates to b no matter what the 
value of $ u is. Take such an assignment x that fits the real value of <3? n . Note that for every heavy 
child v we have that y% v is "true", and therefore by changing at most an (e(l + (4/c/e) _fc ))-fraction 
of the variables in we can get it to evaluate to x v . The weight of u is at least {4k/e)- £+1 , thus 
the total weight of the other heavy children is at most 1 — (4k/e)~ e+1 and the total weight of the 
light children is at most |(4/c/e)~^. So by changing all subformulas to evaluate to the value implied 
by x we change at most an (e(l + (4k/e)~ k ))(l — (4/c/e)~^ +1 ) + |(4/c/e) - ^ < e fraction of inputs 
and get an assignment where $ evaluates to b. Note that this x is not necessarily the one found in 
Line [M 
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Thus we have found that finding an assignment x in Line |2"51 assuming the calls are correct, 
implies that is e-close to evaluate to b. The probability that all relevant calls to an assignment 
return "true" incorrectly is at most the probability that the 2k recursive calls err, which by the 
union bound is at most 8, and the algorithm will return "false" correctly with probability at least 
1-8. m 

4 Estimator for monotone formula of bounded arity 

Algorithm [2] operates in a recursive manner, and estimates the distance to satisfying the formula 
rooted in r according to estimates for the subformula rooted in every children of r. The algorithm 
explicitly receives a confidence parameter 8 as well as the approximation parameter e, and should 
with probability at least 1 — 8 return a number rj such that the input is both (rj + e)-close and 
(r/ — e)-far from satisfying the given formula. The explicit confidence parameter makes the inductive 
arguments easier and clearer. 

Algorithm 2 

Input: read-once /c-basic formula = (V, E, r, X, k), parameters e, 8 > 0, oracle to a . 
Output: rj G [0, 1]. 

1: if K r G X then return 1 — a(n r ) 

2: if e > 1 then return 

3: if K r = V and there exists u G Children(r) with |<J> U | < e|<J>| then return 
4: if K r = A then 

5: for i = 1 to I = [1000e- 2fc - 2 (4A;) 2fe • log(l/<5)] do 

6: u < — a vertex in Children(r) selected independently at random, where the probability 

that w G Children(r) is selected is |3>,u|/|<I>| 
7: on i — Algorithm 02^, e(l - (4k/e)- k /8), 5e(4k/e)- k /16, a) 
8: end for 
9: return J2i=i a i/l 
10: else if k t ^ A then 

11: for every light child u of r set a u < — 

12: for every heavy child u of r set a u < — Algorithm [2]^<I> U , e(l + (4k/e)~ k ), 8/ max{/c, 1/e}, a) 
13: for every term C in the mDNF of K r set ac < — J2ueC a « ' ~WT 
14: return minlac 1 : C G mDNF(«: r )} 
15: end if 



The following states that Algorithm [2] indeed gives an estimation of the distance. While esti- 
mation algorithms cannot have 1-sided error, there is an additional feature of this algorithm that 
makes it also useful as a 1-sided test (by running it and accepting if it returns r] = 0). 

Theorem 4.1 With probability at least 1 — 8, the output of Algorithmic^, e, 8, a) is an rj such that 
the assignment a is both (r] + e)-close to satisfying $ and (rj — e)- far from satisfying it. Additionaly, 
if the assignment a satisfies then rj = with probability 1. Its query complexity (treating k and 
8 as constant) is always 0(exp(poZy(e -1 ))). 
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Proof. The bound on the number of queries is a direct result of Lemma 14.31 below. Given that, 
the correctness proof is done by induction on the height of the formula. The base case (for any e 
and 8) is the observation that an instantiation of the algorithm that makes no recursive calls (i.e. 
triggers the condition in Line [T] or [2]) always gives a value that satisfies the assertion. 

The induction step uses Lemma 14.41 and Lemma 14.51 below. Given that the algorithm performs 
correctly (for any e and 8) for every formula <J?' of height smaller than <I>, the assertions of the 
lemma corresponding to K r (out of the two) are satisfied, and so the correctness for $ itself follows. 
■ 

The dependency on 8 can be made into a simple logarithm by a standard amplification technique: 
Algorithm [2] is run 0(1/5) independent times, each with a confidence parameter 2/3, and then the 
median of the outputs is taken. 

Lemma 4.2 When called with <3? ; e, 8, and oracle access to a, Algorithm goes down at most 
2(4k/e) k log(l/e) = poly(e) recursion levels. In those recursion levels, 8 decreases by a factor of at 
most (e(4A;/e)-Vl6) 2(4fe/e)&los(1/e) = exp(poly(l/e)). 

Proof. Recursion can only happen on Line [7| and Line[T2J Moreover, because of the formula being 
fc-basic, recursion cannot follow through Line [7| two recursion levels in a row. ■ 

Lemma 4.3 When called with <£, e, 8, and oracle access to a, Algorithm^ uses a total of at most 
exp(poly(l/e)) queries. 

Proof. Denoting by 8' the smallest 8 in a recursive call, we have 8' > 8(e(4k/e)~ k /16) 2 ( 4fc//e ) fc lo s(V e ) 
by Lemma 14.21 The number of recursive calls per instantiation of the algorithm is thus at most 
/' = [~1000e _2fc_2 (4A;) 2fc • log(l/<5')] = poly(l/e). As the algorithm may make at most one query 
per instantiation, and this only in the case where a recursive call is not performed, the total 
number of queries is (bounding the recursion depth through Lemma f4.2p at most (Z') 2 ( 4fc / e ) lo s( 1 / e ) = 
exp(poly(l/e)). ■ 

Lemma 4.4 If K r ^ A and all recursive calls satisfy the assertion of Theorem \4-l\ then with 
probability at least 1—8 the current instantiation of Algorithm^ provides a value rj such that a is 
both (rj + e) -close to satisfying <I> and (rj — e) -far from satisfying it. Furthermore, if a satisfies $ 
then with probability 1 the output is rj = 0. 

Proof. First we note that Step El if triggered, gives a correct value for rj (as the a can be made 
into a satisfying assignment by changing possibly all variables of the smallest child of r). We also 
note that if K r = V and Step [3] was not triggered, then by definition all of r's children are heavy, 
and there are no more than 1/e of them. 

The true farness of a from $ is the minimum over all terms C in K r of the adjusted cost of making 
all children of C evaluate to 1, which is Xmec farness {a, SAT($ U )) ■ J^jl. Now in this case there 
are clearly no more than max{£;, e -1 } children, and so by the union bound, with probability at least 
1 — 8, every call done through Line [7] gave a value rj u so that indeed a is (r ?u + e(l + (4A:/e)- /c ))-close 
and (rj u - e(l + (4fc/e)~ fc ))-far from $ u . 

Now let Di denote Cj minus any light children that it may contain. It may be that some D^s 
contain all heavy children, but as there are no forcing children (and there are heavy children) 
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it must be the case that some D^s do not contain all heavy children, and in Line Q31 these will 
dominate. Note that J2ueD- \*&u\ < (1 — (4/c/e) 1 )|<I>| for any Di not containing a heavy child. This 
implies by bounding (1 + (Ak/e)~ k )) ■ (1 - (Ak/e) 1 ' 1 ): 

^farness(cj,^r($ u ))-l^-e < ^ D ^ u ^ < ^ f arness(a, SAT($ u ))-^+e-2£;(4£;/e)- £ 

Now the true farness of Cj not containing all heavy children is at least that of Di , and at most 
that of Di plus with the added farness of making all light children evaluate to 1, which is bounded 
by k(Ak/e)~ i . This means that for such a Cj we have: 

]Tfarness(a,&ir($ M ))-^-e < ^ D ^ u ^ < £ f a rness(a, SylT($ u ))-^+e-/t(4£:/e)- £ 
ueCi II I I ueCi I I 

As the value returned as r\ is the minimum over terms Cj in k t of • ~^pgj • — ' • We also know 
that this minimum is reached by some Cj which does not contain all heavy children, but it may 
be that in fact f arness(<r, SAT($>)) = J2 ueC rfarness (cr, SAT(^ U )) ■ for some i ^ j (the true 
farness is the minimum of the total farness of each clause, but it may be reached by a different 
clause) . 

By our assumptions 

f arness(cr, SAT($))-e = arness(fj, SAT(^ u ))-^--e < arness(a, S' J 4T($ u ))~-f < n 

so we have one side of the required bound. For the other side, we split into cases. If Cj also 
does not contain all heavy children then we use the way we calculated rj as the minimum over the 
corresponding sums: 

V = ^ |^j < f arness(cr, SAT{<$)) + e 

In the final case, we note that by the assumptions on the light children we will always have (recalling 
that Ci will in particular have all heavy children of Cj): 

r,= j^ggiZpj^j < Y,i^ness(a,SAT(^ u ))-^-+e-k(ik/e)- e < £f arness(a, SAT(<5> u ))-^+e 
where the rightmost term equals f arness(er, SAT(&)) + e as required. 

For the last part of the claim, note that if a satisfies then in particular, one of the terms 
C of K r must be satisfied. By the induction hypothesis, for all u £ C we would have a u = and 
therefore ac = 0, and since a is taken as a minimum over all terms we would have a = 0. ■ 

Lemma 4.5 If K r = A and all recursive calls satisfy the assertion of Theorem \4-l\ then with 
probability at least 1 — 5 the current instantiation of Algorithm provides a value n such that a 
is both (j) + e)-close to satisfying & and (rj — e) -far from satisfying it. If a satisfies <3? then with 
probability 1 the output is n = 0. 
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Proof. First note that if we sample a vertex w according to the distribution of Line [5] and then 
take the true farness f arness(cr, SAT($ W )), then the expectation (but not the value) of this equals 
f arness(o", SAT(§>)). This is because to make a evaluate to 1 at the root, we need to make all its 
children evaluate to 1, an operation whose adjusted cost is given by the weighted sum of farnesses 
that corresponds to the expectation above. 

Thus, denoting by X{ the random variable whose value is farness (<r, SAT($ Wi )) where Wi is 
the vertex picked in the ith iteration, we have EpQ] = f arness(<r, SAT(&)). By a Chernoff type 
bound, with probability at least 1 — 5/2, the average X of X\, . .. ,Xi is no more than e k+1 (Ak)~ k / '16 
away from E[Xj] and hence satisfies: 

f arness(tj, SAT($)) - e k+l (Ak)~ k /16 < X < f arness(CT, SAT($)) + e k+1 (4k)~ k /16 

Then note that by the Markov inequality, the assertion of the lemma means that with probability 
at least 1 — 5/2, all calls done in Line H2] but at most e(4k/e)~ k /16 of them return a value r\ w so 
that a is (r/ w + e(l - (4A;/e) _fc /8))-close and (rj w - e(l - (4A:/e)~ fc /8))-far from $ w . 

When this happens, at least (1 — e(4k / e)~ k / 16) of the answers Oj of the calls are up to e(l — 
(4k/e)~ k /16)) away from each corresponding Xi, and at most e(4k/e)~ k /16 of the answers a% are 
up to 1 away from each corresponding X^. Summing up these deviations, the final answer average 
r\ satisfies 

X - e(l - (4fc/e)- fc /4) - e(4A;/e)- fc /16 < V < X + e(l - {4k/e)- k /A) + e{4k/e)- k /16 

With probability at least 1 — 5 both of the above events occur, and summing up the two 
inequalities we obtain the required bound 

f arness(cr, SAT(<$>)) - e < tj < f arness(a, SAT($)) + e 

■ 

5 Quasi-polynomial Upper Bound for Basic-Formulae 

Let = (V, E, r, X, n, B) be a basic formula and a an assignment to <£. 

The main idea of the algorithm is to randomly choose a full root to leaf path, and recurs over all 
the children of "V" vertices on this path that go outside of it, if they are not too many. The main 
technical part is in proving that if a is indeed e-far from satisfying then many of these paths 
have few such children (few enough to recurs over all of them) , where additionally the distance of a 
from satisfying the corresponding sub-formulae is significantly larger. An interesting combinatorial 
corollary of this is that formulae, for which there are not a lot of leaves whose corresponding paths 
have few such children, do not admit e-far assignments at all. 

5.1 Critical and Important 

To understand the intuition behind the following definitions, it is useful to first consider what 
happens if we could locate a vertex that is "(e, cr)-critical" in the sense that is defined next. 



14 



Definition 5.1 [ (e, a) -important, (e, a) -critical ] A vertex v E V is (e, cr)-important if a ^ 
SAT($>), and for every u that is either v or an ancestor of v, we have that 

• farness(<7,&471($ u )) > (2e/3)(l + 2e/3) [depth. (u)/3j 

• If K u = y and u ^ v then h(u) is either v or an ancestor of v. 

An (e, a) -critical vertex v is an (e, a) -important vertex v for which k v € X. 

Note that such a vertex is never too deep, since f arness(cr, SAT($> U )) is always at most 1. The 
following observation follows from Definition 15.11 

Observation 5.2 If v is (e, a) -important, then depth^t;) < 4e _1 log (2e ). 

A hypothetical oracle that provides a critical vertex can be used as follows. If v is the vertex 
returned by such an oracle, then for every ancestor u of v, such that k u = V, and every w € 
Children(u) that is not an ancestor of v, a number of recursive calls with $> w and distance parameter 
significantly larger than e are used. The following Lemma implies that if for each of these vertices 
one of the recursive calls returned 0, then we know that a SAT(&). 

Lemma 5.3 The set of special relatives of v € V is the set T of every u that is not an ancestor 
of v or v itself but is a child of an ancestor w of v, where k w = V. If a £ SAT(& U ) for every 
ueTU {v}, then a g" SAT($). 

Proof. If depth^u) = then a SAT($ V ) implies a SAT($>). Assume by induction that the 
lemma holds for any formula $>' = (V', E', r' , X' , k'), assignment a' to $' and vertex u £ V such 
that < depths (u) < depths (v). Let w be the parent of v. Observe that the special relatives of 
w are a subset of the special relatives of v and hence by the induction assumption we only need to 
prove that a g" SAT($ W ) in order to infer that a g" SAT($). 

If k w = A, then a SAT($ V ) implies that a g" SAT($ W ). If k w = V, then a SAT($ V ) and 
a SAT(® U ) for every u G T implies that a SAT($ W ), since Children^) \ {v} C T. ■ 

The following Lemma state that if a is e-far from SAT(<&), then (e, <r)-critical vertices are 
abundant, and so we can locate one of them by merely sampling a sufficiently large (linear in 1/e) 
number of vertices. 

The main part of the proof that it holds is in showing that if a is only 2e/3-far from SAT(<&), 
then there exists an (e, cr)-critical vertex for a. We first show that this is sufficient to show the 
claimed abundance of (e, cr)-critical vertices, and then state and prove the required lemma. 

Lemma 5.4 If a is e-far from SAT($>), then \{v\v is (e, a) -critical}] > e|<J>|/4. 

Proof. Set Critical ejCr = {v\v is (e, cr)-critical} and assume on the contrary that |Critical e a \ < 
e|<J>|/4. Set a' to be an assignment to X so that for every s £ V where k s £ I, we have that 
ct'(k s ) = 1 if k s G Critical £jCr and otherwise a'(x) = a(x). Thus Critical,^/ = 0. By the triangle 
inequality we have that f arness(cr, SAT($>)) — f arness(o"', SAT($>)) < f arness(er', a). Finally 
since Critical,^/ = 0, Lemma f5.5l which we prove below, asserts that f arness(u / , SAT(&)) < 2e/3 
and we reach a contradiction. ■ 
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Lemma 5.5 If there is no (e, a) -critical vertex, then a is 2e/3-close to SAT(<&). 

Proof. We shall show that if a is 2e/3-far from SAT(<&), then there exists an (e, cr)-critical vertex. 
Assume that a is 2e/3-far from SAT(<&). This implies that r is an (e, cr)-important vertex. Hence an 
(e, cr)-important vertex exists. Let v be an (e, unimportant vertex such that depths (v) is maximal. 
Consequently, none of the vertices in Children(-u) is (e, cr)-important. We next prove that v is 
(e, cr)-critical. 

Assume on the contrary that v is not (e, <r)-critical. Consequently k v X and hence to get a con- 
tradiction it is sufficient to show that there exists an (e, cr)-important vertex in Children(-u). If k v 
v. then by Observation EH] we get that farness(a, SAT($ h{v) )) > (2e/3)(l+2e/3) Ldepth # (h(«))/3J ; 
and hence h(v) is (e, cr)-important. 

Assume that n v = A. Let u be such that farness(<7, SAT( > f arness(cr, SAT(& v )). Ob- 
servation [2TT2] asserts that such a vertex exists. We assume that depths (u) > 2, since otherwise 
it cannot be the case that f arness(<7, SAT(Q U )) < (2e/3)(l + 2e/3)°. Let w E V be the par- 
ent of v. Since w is an ancestor of v it is (e, cr)-important, and hence farness(<7, SAT( $ w )) > 
(2e/3)(l + 2e/3)L de P th *( u ')/ 3 J. Since $ is basic we have that k w = V. Thus by Observa- 
tion ETT] we get that f arnessO, SAT(§ V )) > (2e/3)(l + 2e/3) 1+ Ldepth^ (^)/3j _ Finally since 
farness(o", SAT( > farness(cr, SAT{ and depth^u) = depth^u;) + 2 we get that 
farness(a, SAT{$ U )) > (2e/3)(l + 2e/3) Ldepth $ ( u )/3j _ 

■ 

5.2 Algorithm 

This algorithm detects far inputs with probability 0(e), but this can be amplified to 2/3 using 
iterated applications. 

Algorithm 3 

Input: read-once basic formula <!> = (V, E, r, X, k), a parameter e > 0, oracle to a . 
Output: z € {0,1}. 

1: if e > 1 then return 1 

2: if K r £ X then return a(n r ) 

3: Pick s uniformly at random from all v such that k v € X 

4: A i — all ancestors v of s such that k v = V 

5: R < — (UtieA Children(v)) \ {w \ w is an ancestor of s} 

6: if \R\ > 3e- 2 log(2e -1 ) then return 1 

7: for all u G R do 

8: y u i — 1 

9: for i = 1 to [206- 1 log e" 1 ] do y u i — y u A Algorithm [3J<I> U , a, 4e/3) 
10: end for 

11: return ct(k s ) V \f ueR y u 



We can now proceed to prove the correctness of Algorithm [3l Algorithm [3] is clearly non- 
adaptive. We next prove that it always returns "1" for an assignment that satisfies the formula, 
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and returns "0" with probability linear in e for an assignment that is e-far from satisfying the 
formula. Using 0(1/ e) independent iterations amplifies the later probability to 2/3. 

Lemma 5.6 For e > 0, Algorithm^ halts after using at most e - 16 + 161 °g e queries, when called 
with <!?, e and oracle access to a. 

Proof. The proof is formulated as an inductive argument over the value of the (real) farness 
parameter e. However, it is formulated in a way that it can be viewed as an inductive argument 
over the integer valued [~log(ae _1 )~| , for an appropriate global constant a. 

If e > 1, then the condition in Line [U is satisfied, and there are no queries or recursive calls. 
Hence we assume that e < 1. Observe that in a specific instantiation at most one query is used, 
since a query is only made on Line [2] or on Line [TT1 and always as part of a "return" command. 
Hence the number of queries is upper bounded by the number of calls to Algorithm [3] (initial and 
recursive). We shall show that the number of these calls is at most e ~ 16 + 161o s e . 

Assume by induction that for some rj < 1, for every rj < rf < 1, every formula $' and assignment 
a' to on call to Algorithm with <J?', rf and an oracle to a', at most ^'~ 16+161og '' calls to 
Algorithm are made (including recursive ones). 

Assume that e > 3r//4. If K r E X, then the condition on Line [His satisfied and hence there are 
no recursive calls. Thus Algorithm [3] is called only once and 1 < e - 16 + 161o g e . 

Assume that K r X. Note that every recursive call is done by Lineal By Line [7] and LineO 
at most \R\ ■ [20e _1 log e -1 ] recursive calls are done. The condition on Line [6] ensures that \R\ ■ 
[20e _1 log e _1 ] < 3e~ 2 log (2e _1 ) • [~20e _1 loge -1 ] . According to Line M each one of these recursive 
calls is done with distance parameter 4e/3 > rj. Thus by the induction assumption the number of 
calls to Algorithm is at most 1 + 3e~ 2 log (2 e - 1 ) • ^Oe" 1 log e" 1 ] • (4e/3)~ 16+161og(4e/3) . This ig 
less than e -i6+i6lo ge _ B 

The following theorem will be immediate from Lemma 15.61 above when coupled with Lemma 
15.91 and Lemma 15.111 below. 

Theorem 5.7 Let e > 0. When Algorithm^ is called with <!>, e and an oracle to a, it uses at most 
e -i6+i6ioge queries; if a € SAT($>) then it always returns 1, and if a is e-far from SAT(Q) then it 
returns with probability at least e/8. 

Theorem 15.71 does not imply that Algorithm is an e-test for SA T($). However it does imply 
that in order to get an e-test for SAT(&) it is sufficient to do the following. Call Algorithm [3] 
repeatedly [~20e -1 loge -1 ] times, return if any of the calls returned 0, and otherwise return 1. 
This only increases the query complexity to the value in the following corollary. 

Corollary 5.8 There exists an e-test for <I>, that uses at most e - 20 + 161 °g (E queries. 

Lemma 5.9 Let e > and a G SAT(<&). Algorithm^ returns 1 when called with <I>, e and an 
oracle to a. 

Proof. To prove the lemma we shall show that if Algorithm [3] returns 0, when called with <3?, e 
and oracle access to a, then a £ SAT($>). If depth(<I>) = then the condition in Line [1] is satisfied 
and cr(«v) is returned. Hence a(K r ) = and therefore a SAT(<&). Assume that for every e' > 0, 
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$' where depth( < I> / ) < depth(<I>), and assignment a' to if Algorithm returns 0, when called 
with e' and oracle access to a', then a' G" 5v4T($). 

Observe that the only other way a can be returned is through Line 1TT| if it is reached. Let R 
be the set of vertices on which there was a recursive call in Line and k s the variable whose value 
is queried on Line II 1L According to Line HU a is returned if and only if <t(k s ) = and for every 
u G R there was at least one recursive call with <& u and distance parameter 4e/3 that returned a 
0. By the induction assumption this implies that a G" SAT{Q U ) for every u G R. Note that the 
set R satisfies the exact same conditions that the set T satisfies in Lemma 15.31 Hence, Lemma 15.31 
asserts that a G" SAT(<$>). ■ 

We now turn to proving soundness. This depends on first noting that the algorithm will indeed 
check the paths leading to critical vertices. 

Observation 5.10 // the vertex s picked in Line\Q is (e, a) -critical, then it will not trigger the 
condition of Line 

Proof. Definition 15.11 in particular implies that for every u G A (as per Line H]) we have 
|Children(u)| < (3/2e)(l + 2e/3)"L de P th $(«)/ 3 J < 3/2e, as otherwise a will be too close to 
satisfying <& v . Also, from Observation 15.21 we know that depth^s) < 4e _1 log (2e~ x ) and so 
\A\ < 2e- 1 log(2e- 1 ) + 1. 

The two together give us the bound \R\ < (3/2e - l)(2e _1 log (2c" 1 ) + 1) < 3e~ 2 log(2e _1 ), and 
so the condition in Line [3] is not triggered. ■ 

Lemma 5.11 Let a he e-far from SAT(&). If Algorithm^ is called with e, <3? and an oracle to a, 
then it returns with probability at least e/8. 

Proof. The base case, K r G X, is handled correctly by Line [TJ Assume next that e > 3/4. 
Assume that the vertex s selected in Line [3] is (e, cr)-critical. Hence by definition a is more than 
1/2-far from SAT(& U ) for every ancestor u of s. Thus by Observation 12.171 we have that k u = A 
for every ancestor u of s. Consequently, by Line [2] and Line [TT] the value returned will be cr(n s ), 
and o~(k. s ) = because s is (e, cr)-critical. By Lemma 15.41 with probability at least 3/16 the vertex 
s selected in Line [3] is (e, (j)-critical. Thus, is returned with probability at least 3/16, which is 
greater than e/8 when 3/4 < e < 1. 

For all other e we proceed by induction over the depth. Assume that for any formula $' such 
that depth( < I> / ) < depth(<I>) and any assignment a' to that is 77-far from SAT(&') (for any 77), 
Algorithm [3] returns with probability at least n/8. Given this we prove that is returned with 
probability at least e/8 for and a. 

Assume first that the vertex s selected in Line[3]is (e, cx)-critical. Let A, R be the sets from Line[5] 
and Lined! Since s is (e, <r)-critical, by definition for every u € A we have that a is 2e/3-far from 
SAT(& U ). Also, because s is (e, cr)-critical, by definition for every u G A and w G Children(ti)ni? we 
have that w 7^ h(u), and therefore by Observation 12 . 1 71 we have that a is 4e/3-far from SAT(& W ) for 
every w G R. By the induction assumption, for every w G R, with probability at least 1 — (4e/3)/8 
Algorithm [3] returns when called with 4e/3, & w and an oracle to a. Hence, for every w G R, the 
probability that on [20e _1 log e _1 ] such independent calls to Algorithm [3] the value was never 
returned is at most (1 - (4e/3)/8) r20e ~ 1 loge_1 \ This is less than (e~ 2 log (2e _1 ))/6. Observation 
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15. 1UI ensures that \R\ < 3e~ 2 log (2e _1 ), and in particular the condition in LineOis not invoked and 
the calls in Line [9] indeed take place. By the union bound over the vertices of R, with probability 
at least 1/2, for every u € R at least one of calls to Algorithm [3] with 4e/3, § u and an oracle to a 
returned the value 0. This means that for every u € R, y u in Line was set to and remained 0. 
Consequently this is the value returned in Line fTTI 

Finally, since a is e-far from SAT(&), by Lemma 15.41 the vertex s selected in Line [3] is (e, <r)- 
critical with probability at least e/4. Therefore is returned with probability at least e/8. ■ 

6 The Computational Complexity of the Testers and Estimator 

There are two parts to analyzing the computational complexity of a test for a massively parametrized 
property. The first part is the running time of the preprocessing phase, which reads the entire pa- 
rameter part of the input, in our case the formula, but has no access yet to the tested part, in our 
case the assignment. This part is subject to traditional running time and working space definitions, 
and ideally should have a running time that is quasi-linear or at least polynomial in the parameter 
size. 

In our case, the preprocessing part would need to take a k-ary formula and convert it to the 
basic form corresponding to the algorithm that we run. We would also need to put the formula in 
a data structure that allows the following operations to be performed as quickly as possible: 

For Algorithm [H we would need to quickly pick a child of a vertex with probability proportional 
to its sub-formula size, and know who are the light children as well as what is the relative size of 
the smallest child. This mainly requires storing the size of every sub-formula for every vertex of 
the tree, as well as sorting the children of each vertex by their sizes and storing the value of the 
corresponding Algorithm [2] requires very much the same additional data as Algorithm [TJ 

For Algorithm [31 we would need to navigate the tree both downwards and upwards (for finding 
the ancestors of a vertex), as well as the ability to pick a vertex corresponding to a variable at 
random, which in itself does not require special preprocessing. 

It is not hard to see that both constructing the normal form and calculating the above additional 
data can be done very efficiently. Furthermore, the only part that depends on epsilon is designating 
the light children, but this can also be done "for all epsilon" at a low cost (by storing the range of 
e for every positive £). 

The second part is analyzing the running time complexity of the algorithm. Once the above 
preprocessing is performed, the time per instantiation (and thus per query) of the algorithm will 
be very small (where we charge the time it takes to calculate a recursive call to the recursive 
instantiation). This would make it a cost logarithmic in the input size per query (multiplied by 
the time it takes to write and read an address) - where the log incurrence is in fact only when we 
need to randomly choose a child according to its weight. 

7 An Untestable 4- Valued Formula 

We describe here a read-once formula over an alphabet with 4 values, defining a property that 
cannot be 1/4-tested using a constant number of queries. The formula will have a very simple 
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structure, with only one gate type. However this gate satisfies no monotonicity-like property with 
respect to the alphabet. 

7.1 The formula 

For convenience we denote our alphabet by £ = {0, 1,P, P}. An input is said to be accepted by 
the formula if, after performing the calculations in the gates, the value received at the root of 
the tree is not "F". We restrict the input variables to {0, 1}, although it is easy to see that the 
following argument holds also if we allow other values to the input variables (and also if we change 
the acceptance condition to the value at the root having to be "P"). 

Definition 7.1 The balancing gate is the gate that receives two inputs from £ and outputs the 
following. 

• For (0,0) the output is and for (1, 1) the output is 1. 

• For (1,0) and (0, 1) the output is P. 

• For (P, P) the output is P , 

• For anything else the output is F . 

For a fixed h > 0, the balancing formula of height h is the formula defined by the following. 

• The tree is the full balanced binary tree of height h, and hence there are 2 h variables. 

• All gates are set to the balancing gate. 

• The formula accepts if the value output at the root is not "F ". 

We denote the variables of the formula in their order by xq, . . . , x 2 h_i- The following is easy. 

Lemma 7.2 An assignment ao € {0, 1}, . . . , a 2 /t_i € {0,1} to xq, . . . , x 2 h_\ accepted by the 
formula if and only if for every < k < h and every < i < 2 h ~ k , the number of 1 values in 
a i2 k , . . . , a(j+i)2fc-i is either 0, 2 k or 2 k ~ 1 . 

Proof. Denote the number of 1 values in variables descending from a gate u by tti( M )- Let us 
prove by induction on k that #i(v) = if and only if the value of v is 0, tti( w ) = 2 fc if and only if 
the value of v is 1, and #i(v) = 2 fc_1 if and only if the value of v is P. For k = 1 we have the two 
inputs of v, and by the definition of the balancing gate the claim follows. 

For k > 1, we have 2 k variables which are all descendants of the same gate v. By the induction 
hypothesis, for both children of v, denoted u, w, we have that Ui(it),|ji(«;) G {0, 2 k ~ 2 , 2 k ~ 1 } and 
that this determines their value. If #i(w) = tJi(«) = then they both evaluate to and so does 
v. Similarly §i(w) = Ui(u) = 2 k ~ l then both evaluate to 1 and so does v. If tti(w) = 2 k ~ 1 and 
JJi (t/j) = 0, then u evaluates to 1 and w to and indeed v evaluates to P (similarly for the symmetric 
case). The remaining case is tti(^) £ {0,2 fc_1 } and tti(^) = 2 k ~ 2 (and the symmetric case), by the 
induction hypothesis and the definition of the balancing gate this implies that v evaluates to F and 
the formula is unsatisfied. ■ 

In other words, every "binary search interval" is either all 0, or all 1, or has the same number of 
and 1. This will allow us to easily prove that certain inputs are far from satisfying the property. 
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7.2 Two distributions 

We now define two distributions, one over satisfying inputs and the other over far inputs. 

Definition 7.3 The distribution Dy is defined by the following process. 

• Uniformly pick 2 < k < h. 

• For every < i < 2 h ~ k , independently pick either (yifi,Ui,i) = (0,1) or (yi : o,Vi,i) = (1,0) 
(each with probability 1/2). 

• For every < i < 2 h ~ k , set 

x i2 k = ■ ■ ■ = X i2 k +2 k-i_i = yifl] x i2 k +2 k - 1 = • • • = £(j+l)2 fc -l = Vi,l- 

Definition 7.4 The distribution Dn is defined by the following process. 

• Uniformly pick 2 < k < h. 

• For every < i < 2 h ~ k , independently choose (zi : o, Zj i, Zi 2, ^,3) to have either one 1 and 
three or one and three 1 (each of the 8 possibilities with probability 1/8). 

• For every < i < 2 h ~ k , set 

x i2 k = • ■ ■ = X i 2 k +2 k - 2 -l = z i,0i X i 2 k +2 k ~ 2 = ' ' ' = x i2 k +2 k - 1 -1 = z i,l'i 

x i2 k +2 k - 1 = • • • = X i2 k +2 k-i + 2 k -' 2 -l = z i,2] x i2 k +2 fc - J +2 k - 2 = ' ' " = x (i+l)2 k -l = z i,3- 

It is easier to illustrate this by considering the calculation that results from the distributions. 
In both distributions we can think of a randomly selected level k (counted from the bottom, where 
the leaf level and the level above it are never selected). In Dy, the output of all gates at or 
above level k is "P", while the inputs to every gate at level k will be either (0, 1) or (1,0), chosen 
uniformly at random. 

In Dn all gates at level k will output U F" (note however that we cannot query a gate output 
directly); looking two levels below, every gate as above holds the result of a quadruple chosen 
uniformly from the 8 choices described in the definition of Dn (the quadruple (z^o, z^i, 2^2, ^,3)). 
At level k — 2 or lower the gate outputs are and 1 and their distribution resembles very much the 
distribution as in the case for Dy - as long as we can not "focus" at the transition level k. This is 
formalized in terms of lowest common ancestors below. 

Lemma 7.5 Let Q C {1, . . . , 2 h } be a set of queries, and let H C {0, . . . , h} be the set of levels 
containing lowest common ancestors of subsets of Q. Conditioned on neither k nor k — 1 being in 
H, both Dy and Dn induce exactly the same distribution over the outcome of querying Q. 

Proof. Let us condition the two distributions on a specific value of k satisfying the above. For 
two queries q,q' <G Q whose lowest common ancestor is on a level below k — 1, with probability 1 
they will receive the exact same value (this holds for both and Dy). The reason is clear from 
the construction - their values will come from the same j/jj or Zij. 
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Now let Q' contain one representative from every set of queries in Q that must receive the same 
value by the above argument. For any q, q' E Q', their lowest common ancestor is on a level above 
k. For Dy it means that x q takes its value from some yij and x q i takes its value from some j/ 
where i ^ i! . Because each pair (2/1,0,2/1,1) is chosen independently from all others, this means that 
the outcome of the queries in Q' is uniformly distributed among the 2l Q 'l possibilities. The same 
argument (with z^j and z^j' instead of yij and 2/i'j') holds for D^. Hence the distribution of 
outcomes over Q' is the same for both distributions, and by extension this holds over Q. ■ 

On the other hand, the two distributions are much different with respect to satisfying the 
formula. 

Lemma 7.6 An input chosen according to Dy always satisfies the balancing formula, while an 
input chosen according to Dn is always 1/ A- far from satisfying it. 

Proof. This follows immediately from the construction and Lemma 17.21 ■ 
7.3 Proving non-testability 

We use here the following common application of Yao's method (see e.g. [6]). 

Lemma 7.7 If Dy is a distribution over satisfying inputs and Dn a distribution over e-far ones, 
such that for any fixed set of queries Q with \Q\ < I the probability distributions over the outcomes 
differ by less than | (in the variation distance norm) for Dy and Djy, then there is no non-adaptive 
e-test for the property that makes at most I queries (1-sided or 2-sided). 

This allows us to conclude the proof. 

Theorem 7.8 Testing for being a satisfying assignment of the balancing formula of height h re- 
quires at least £l(h) queries for a non-adaptive test and Q(\ogh) queries for a possibly adaptive 
one. 

Proof. We note that for any set of queries Q, the size of the set of lowest common ancestors 
(outside Q itself) is less than Q, and hence (in the notation of Lemma 17. 5[) we have \H\ < \Q\. 
Now if \Q\ = o(h), then the event of Lemma 17.51 happens with probability 1 — o(l), and hence the 
variation distance between the two (unconditional) distributions over outcomes is o(l). Together 
with Lemma 17.61 this fulfills the conditions for Lemma 17.71 for concluding the proof. 

For adaptive algorithms the bound follows by the standard procedure that makes an adaptive 
algorithm into a non-adaptive one at an exponential cost (by querying in advance the algorithm's 
entire decision tree given its internal coin tosses). ■ 
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